多路数据源的蛋白质搜索信息整合方法

doi:10.3969/j.issn.1006-2475.2012.09.061

计算机与现代化 ›› 2012, Vol. 1 ›› Issue (9): 232-234,.doi: 10.3969/j.issn.1006-2475.2012.09.061

多路数据源的蛋白质搜索信息整合方法

陈雅琦，朱斐

苏州大学计算机科学与技术学院,江苏苏州215006

收稿日期:2012-08-17 修回日期:1900-01-01 出版日期:2012-09-21 发布日期:2012-09-21

Integrated Method of Information Search for Protein from Different Resource of Database

CHEN Ya-qi， ZHU Fei

School of Computer Science and Technology, Soochow University, Suzhou 215006, China

Received:2012-08-17 Revised:1900-01-01 Online:2012-09-21 Published:2012-09-21

摘要/Abstract

摘要： 蛋白质的研究在如今的科学研究中的地位越来越重要，尤其是当今科技发展速度之迅速，人们所了解的蛋白质信息越来越多，也更加繁杂。而如何在这纷繁的信息之海中高效与精确地搜寻到所需要的蛋白质信息，是需要研究的问题。本文以NCBI和Binding DB为例，设计一种蛋白质搜索信息整合方法，通过从搜索到的信息条目中提取关键字组成二元组并进行分组，在每个分组里同样进行细化关键字的提取和分组，以此循环，并且由二元组衍生到N元组，从而达到去除冗余信息和信息排序整合的目的。


关键词: 蛋白质搜索, 关键字, 二元组, 信息整合, NCBI, Binding DB

Abstract: With the rapid development of biological science and technology, people know more and more about the basic substance, protein. However, the more information, the more difficulties people meet in searching. To get it efficient and precise message of protein, this paper designs a method to extract the information through NCBI and Binding DB for example. The method is about obtaining efficient information with no redundancy. It extracts keywords to form bigram from the information entry which is searched, and then divides it into groups. In each group, the detailed keyword extraction and grouping information is done, and cycles the processes till N-gram is generated, so that it achieves the purpose of getting rid of redundancy and integrates the ordering of information.


Key words: information search for protein, keywords, bigram, information integration, NCBI, Binding DB

陈雅琦;朱斐. 多路数据源的蛋白质搜索信息整合方法[J]. 计算机与现代化, 2012, 1(9): 232-234,.

CHEN Ya-qi;ZHU Fei. Integrated Method of Information Search for Protein from Different Resource of Database[J]. Computer and Modernization, 2012, 1(9): 232-234,.

[1]	罗有志1,2，陈征明2，陈明2，梅文涛2. 一种基于自适应关联熵的关键字提取算法[J]. 计算机与现代化, 2020, 0(04): 67-.
[2]	曹永明. 一种无安全信道的模糊关键字搜索加密方案[J]. 计算机与现代化, 2020, 0(04): 42-.
[3]	秦璐璐，周李京，王敏. 支持多关键字搜索的条件代理重加密[J]. 计算机与现代化, 2020, 0(01): 100-.
[4]	魏东平，罗丹. 一种基于区间预留编码的XML关键字查询算法[J]. 计算机与现代化, 2019, 0(10): 17-.
[5]	曾琦，韩笑，曹永明. 结合公钥加密和关键字可搜索加密的加密方案[J]. 计算机与现代化, 2019, 0(04): 103-.
[6]	韩笑，曾琦，曹永明. 一种有效的带关键字搜索的代理重加密方案[J]. 计算机与现代化, 2019, 0(03): 117-.
[7]	王刚,李非非,王瑶. 指定服务器的基于身份加密连接关键字搜索方案[J]. 计算机与现代化, 2017, 0(4): 118-121.
[8]	张进，冯钧，陆佳民. 基于Hadoop的空间关键字索引方法[J]. 计算机与现代化, 2017, 0(11): 76-83.
[9]	冯光;乔丹丹;常静怡. 基于分词匹配的主观题自动评阅技术研究[J]. 计算机与现代化, 2013, 1(3): 212-214,.
[10]	王雪梅;丛军. 电力营销信息整合平台的设计与实现[J]. 计算机与现代化, 2012, 208(12): 123-126.
[11]	林子铭;罗胜文;管金富;苏雅惠;何靖远. 企业并购后信息整合策略之个案研究[J]. 计算机与现代化, 2011, 12(12): 161-164.

多路数据源的蛋白质搜索信息整合方法

Integrated Method of Information Search for Protein from Different Resource of Database

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 11

编辑推荐

Metrics

本文评价